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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: LONZA AG 

(B) STREET: Muenchensteinerstrasse 38 

(C) CITY: Basle 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE: 4002 

(ii) TITLE OF INVENTION: Process for the 



preparation of (S)- or (R) -3,3, 3-trif luoro-2- 
hydroxy-2-methylpropionic acid 



(iii) NUMBER OF SEQUENCES: 14 

(iv) COMPUTER-READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(c) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 

#1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: Genomic DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE: NO 

(vi) ORIGIN: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(vii) PROVENANCE: 

(B) CLONE (S) : pPRS2a 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join ( 197 .. 1181 ) 

(D) OTHER INFORMATION: /product= "amidase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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CCCGGGAACT CCATGTGGCC GTGATCCTGG TCGAGCAGGA TATTGCGATG ATCCAGCGGG €0 

CCGCACAGCG CTGTGCGGTA ATGGATAAAG GCCTGGTTGT AGAAACGCTG ACCCAACAAC 120 

AGCTCTCTGA TGATCTTTTA ATGCGTCGTC ATCTGGCTCT GTAACTAAAC GCTATAAATT 180 

ACGTGGAGAA TAACAT ATG AAA TGG TJG GAA GAA TCC AT? ATG GCC AAA 229 
Met Lys Tro Leu Glu Glu Ser lie Met Ala Lys 
15 10 
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CGC GGT GTT GGT GCC GGG CGT AAA CCG GTA ACG CAT CAC CTG ACG GAA 277 
Arg Gly Val Gly Ala Gly Arg Lys Pro Val Thr His His Leu rnr giu 
X5 20 



GAA ATG CAA AAA GAG TTT CAT TAG ACC ATT CGC CCT IM TCC ACA CCC 325 
Glu Met Gin Lys Glu Phe His Tyr Thr He Gly Pro Tyr Ser Thr Pro 
30 35 40 

GTC CTG ACC ATC GAA CCC GGT GAC CGG ATT ATT GTC GAC ACT CGA GAT 373 
Val Leu Thr He Glu Pro Gly Asp Arg He He Val Asp Thr Arg Asp 
45 SO 55 

GCT TTT GAA GGT OCT ATC AAT TCG GAA CAG GAT ATT CCG AGC CAG TTG 421 
Ala Phe Glu Gly Ala He Asn Ser Glu Gin Asp He Pro Ser Gin Leu 
60 «5 70 75 

CTA AAA ATG CCC TTT CTC AAC CCA CAA AAC GGA CCG ATC ATG GTC AAT 469 
Leu Lys Met Pro Phe Leu Asn Pro Gin Asn Gly Pro He Met Val Asn 
80 85 90 

GGC GCG GAG AAA GGT GAT GTG CTC GCT GTC TAT ATC GAA TCC ATG TTG 517 
Gly Ala Glu Lys Gly Asp Val Leu Ala val Tyr He Glu Ser Met Leu 
95 100 105 

CCC CGC GGC GTT GAT CCC TAC GGC ATC TGC GCC ATG ATT CCG CAT TTT 565 
Pro Arg Gly Val Asp Pro Tyr Gly He Cys Ala Met He Pro His Phe 
110 115 120 

GGC GGA CTG ACC GGG ACC GAC CTG ACG GCC ATG CTC AAT GAT CCG CTG 613 
Gly Gly Leu Thr Gly Thr Asp Leu Thr Ala Met Leu Asn Asp Pro Leu 

130 135 



125 



CCA GAA AAG GTG CGC ATG ATT AAA CTC GAC AGT GAA AAG GTC TAC TG3 661 
Pro Glu Lys Val Arg Met He Lys Leu Asp Ser Glu Lys Val Tyr Tr? 
140 145 150 155 

AGC AAA CSC CAT ACG CTT CCC TAT AAA CCC CAT ATT GGC ACC TTG AGC 709 
Ser Lys Arg His Thr Leu Pro Tyr Lys Pro His He Gly Thr Leu Ser 
160 165 170 

GTA TCG CCA GAA ATT GAC TCA ATC AAT TCA CTG ACG CCA GAC AAT CAC 757 
Val Ser Pro Glu He Asp Ser He Asn Ser Leu Thr Pro Asp Asn His 
175 180 185 

GGC GGG AAT ATG GAT GTG CCG GAT ATA GGA CCA GGG AGT ATT ACC TAT 805 
Gly Gly Asn Met Aso Val Pro Asp He Gly Pro Gly Ser He Thr Tyr 
190 195 200 

CTG CCG GTA CGT GCG CCT GGA GGC CGC CTC TTT ATT GGT GAT GCC CAT 853 
Leu Pro Val Arg Ala Pro Gly Gly Arg Leu Phe He Gly Asp Ala His 
205 210 215 

GCT TGT CAG GGT GAT GGT GAG ATT TGC GGG ACC GCA GTA GAG TTT GCC 901 

Ala Cys Gin Gly Asp Gly Glu He Cys Gly Thr Ala Val Glu Phe Ala 
220 225 230 235 

TCA ATC ACC ACC ATC AAA GTC GAT TTG ATC AAG AAC TGG CAG CTT TCC 949 
Ser He Thr Thr He Lys Val Asp Leu He Lys Asn Trp Gin Leu Ser 
240 245 250 

TGG CCA CGA ATG GAG AAT GCC GAA AAT ATT ATG AGT ATT GGC AGT GCA 997 
Trp Pro Arg Met Glu Asn Ala Glu Asn He Met Ser He Gly Ser Ala 
255 260 265 
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CGT CCG CTG GAG GAT GCG ACG CGA ATT GCA TAT CGC GAC TTA ATT TAC 
Arg Pro Leu Glu Asp Ala Thr Arg lie Ala Tyr Arg Asp Leu lie Tyr 
270 275 280 



1045 



TGO CTG GTA GAA GAC TTT GGC TTC GAA CAA TGG GAT GCC TAC ATG CTT 
Trp Leu Val Glu Asp Phe Gly Phe Glu Gin Trp Asp Ala Tyr Met Leu 
285 290 295 



1093 



CTG ACT CAA TGC GGC AAA GTG CGG CTG GGC AAC ATG GTC GAC CCC AAA 
Leu Ser Gin Cys Gly Lys Val Arg Leu Gly Asn Met Val Asp Pro Lys 
300 305 310 315 



1141 



TAC ACC GTT GGC GCG ATG CTG AAC AAA AAC CTG TTA GTT TAGTAGGAAT 

Tyr Thr Val Gly Ala Met Leu Asn Lys Asn Leu lieu Val 
320 325 



1190 



AACTAACCGG TGAACATTAC CCGGATGTAG ATCGGCGTAA TGTGTAAGTT CAAACAATCG 1250 

CTATTTTTAA CAGCTAAAGC AGGTGCATAT GGGGCCAGAT ACACCCATCA ATATTGGTTT 1310 

ACTTTACTCC TTCAGCGGAG TGACGGCGGC ACAAGAGTTG TCACAATGGC GCGGAGCAAC 1370 

CCAGGCTATT GCCGAAATTA ATCAAAATGG CGGCATCAAC GGCAGACCAC TCAATGCAAT 1430 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Trp Leu Glu Glu Ser He Met Ala Lys Arg Gly Val Gly Ala 
1 5 10 15 

Gly Arg Lys Pro Val Thr His Kis Leu Thr Glu Glu Met Gin Lys Glu 
20 25 30 

Phe His Tyr Thr He Gly Pro Tyr Ser. Thr Pro Val Leu Thr He Glu 
35 40 45 

Pro Gly Asp Arg He He Vai Asp Thr Arg Asp Ala Phe Glu Gly Ala 
50 55 60 

He Asn Ser Glu Gin Asp He Pro Ser Gin Leu Leu Lys Met Pro Phe 
65 70 75 80 

Leu Asn Pro Gin Asn Gly Pro He Met Val Asn Gly Ala Glu Lys Gly 
85 90 95 

Asp Val Leu Ala Val Tyr He Glu Ser Met Leu Pro Arg Gly Val Asp 
100 105 110 

Pro Tyr Gly He Cys Ala Met Tie Pro' His Phe Gly Gly Leu Thr Gly 
115 " 120 125 

Thr Asp Leu Thr Ala Met Leu Asn Asp Pro Leu Pro Glu Lys Val Arg 



TCATTTGGAT CC 



1442 



130 



135 



140 
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Met He Lys Leu Asp Ser Glu Lys Val Tyr Trp Ser Lys Arg His Thr 
145 150 155 160 

Leu Pro Tyr Lys Pro His He Gly Thr Leu Ser Val Ser Pro Glu He 
165 170 175 

Asp Ser He Aen Ser Leu Thr Pro Asp Asn His Gly Gly Asn Met Asp 
180 185 l»0 

Val Pro Asp He Gly Pro Gly Ser He Thr Tyr Leu Pro Val Arg Ala 
195 200 205 

Pro Gly Gly Arg Leu Phe He Gly Asp Ala His Ala Cys Gin Gly Asp 
210 * * 215 22C 

Gly Glu He Cys Gly Thr Ala Val Glu Phe Ala Ser He Thr Thr He 
225 ' 230 23S 240 

Lys Val Asp Leu He Lys Asn Trp Gin Leu Ser Trp Pro Arg Met Glu 
245 250 255 

Asn Ala Glu Asn He Met Ser He Gly Ser Ala Arr Pro Leu Glu Asp 
260 265 270 

Ala Thr Arg He Ala Tyr Arg Asp Leu He Tyr Trp Leu Val Glu Asp 
275 280 285 

Phe Gly Phe Glu Gin Trp Aso Ala Tyr Met Leu Lev: Ser Gin Cys Gly 
290 295 3C: 

Lys Val Arg Leu Gly Asn Met Val Asp Pro Lys Tyr Thr Val Gly Ala 
305 " 310 315 320 

Met Leu Asn Lvs Asn Leu Leu Val 
325 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Lys Trp Leu Glu Glu Ser lie Met Ala Lys Arg Gly Val Gly Ala 
15 10 15 

Ser Arg Lys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 
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(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Val Tyr Trp Ser Lys 
1 S 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Pro Val Thr His His Leu Thr Glu Glu Met Gin Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Tyr Thr Val Gly Ala Met Leu Asn Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 
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(D) TOPOLOGY: not known 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu Asn Ala Glu Asn He Met Ser He Gly Ser Ala Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Trp Leu Glu Glu Ser lie Met Ala Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

Met Pro Phe Leu Asn Pro Gin Asn Gly Pro lie Met Val Asn Gly Ala 
15 10 is 

Glu Lys 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Ala Phe Glu Gly Ala lie Asn Ser Glu Gin Asp lie Pro Ser Gin 
1 5 10 15 

Leu Leu Lys 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Phe His Tvr Thr lie Glv Pro Tyr Ser 7hr Pro Val Leu Thr lie 
1 ' 5 " 10 15 

Glu Pro Gly Asp Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Leu Phe He Gly Asp Ala His Ala Glu Gin Gly Asp Gly Glu He Glu 

15 10 15 

Gly Thr Ala Val Glu Phe Ala 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Giy Asp Val Leu Ala Val Tyr lie Glu Ser Mec Leu P^-o Arc 
1 5 10 " 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(C) INDIVIDUAL /ISOLATE: PRS1 

(vii) PROVENANCE: 

(B) CLONE (S) : PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Val Asp Pro Tyr Gly He Glu Ala Met He Pro His Phe C-*v Glv 

5 10 -5* 

Leu Thr Gly Thr Asp Leu Thr Ala Met Leu Asn Asd Gin Leu C- • n P-o 
20 25 30 

Lys 



